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Abstract 

A construction of Partial Maximum Distance Separable (PMDS) and Sector-Disk 
(SD) codes extending RAID 5 with two extra parities is given, solving an open problem. 
Previous constructions relied on computer searches, while our constructions provide a 
theoretical solution to the problem. 
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1 Introduction 

Consider an m x n array whose entries are elements in a finite field GF(2 b ) jl] (in general, we 
could consider a field GF(p b ), p a prime number, but for simplicity, we constrain ourselves 
to binary fields). The n columns represent storage devices like SSDs, HDDs or tapes. The 
arrays (often called stripes also) are repeated as many times as necessary. In order to protect 
against a device failure, a RAID 4 or RAID 5 type of scheme, in which one of the devices is 
the XOR of the other ones, can be implemented. During reconstruction, the failed device is 
recovered sector by sector. The problem with RAID 5 is, if an additional sector is defective 
in addition to the one corresponding to the failed device, data loss will occur. A solution to 
this problem is using a second device for parity (RAID 6), allowing for recovery against two 
failed devices. However, this scheme may be wasteful, and moreover, it is unable to correct 
the situation in which in addition to the sector corresponding to the failed disk, we have two 
extra failed sectors in the row (we always assume that failed sectors can be identified, either 
by CRC or by other means, so the correcting scheme is an erasure correcting scheme). In 
order to overcome this problem, the so called Partial MDS (PMDS) codes pQ and Sector-Disk 
(SD) codes [5] were created. Very similar codes were presented in [3]. 
We start by giving the definition of PMDS and SD codes. 



1 



l o ^_o__o_ J_J^J_J?_JL 1 ° 1 E ° 

E~~ririr~r irirTirir ~e~ # e i 

A^^AX! !aI?LXAA! "i i ti^ t 

~~^~T~^ ttti r ~i~T l £ T 



Figure 1 : A 4 x 5 array with different types of failures 



Definition 1.1 Let C be a linear [mn,m(n — r) — s] code over a field such that when 
codewords are taken row-wise as m x n arrays, each row belongs in an [n,n — r,r + 1] MDS 
code. Then, 

1. C is an (r; s) partial-MDS (PMDS) code if, for any (si, S2, . . . , St) such that each Sj > 1 
and Y?j=i s j — s ) an d f° r an Y H,^, ■ ■ ■ ,h such that < < i 2 < • • • < i t < m — 1> C 
can correct up to Sj + r erasures in each row ij, 1 < j < t, of an array in C. 

2. C is an (r; s) sector-disk (SD) code if, for any li, l 2 , . . . , l r such that < l\ < I2 < ■ ■ ■ < 
l r < n — 1, for any (s l3 s 2 , ■ ■ ■ , s t) such that each Sj > 1 and Sj=i s j = s ; an d f° r an Y 
ii, i 2 , . . . ,it such that < %\ < i 2 < ■ ■ ■ < it < m — 1, C can correct up to Sj + r erasures 
in each row ij, 1 < j < t, of an array in C provided that locations lx,l 2 , ■ ■ - l r in each 
of the rows ij have been erased. 

SD codes satisfy a weaker condition than PMDS codes, but they may be sufficient in 
most applications. The case of (r; 1) PMDS codes has been solved in [T]. In this paper, we 
address the case of (1;2) PMDS and SD codes. Figure [T] illustrates the difference between 
(1;2) PMDS and SD codes for a 4 x 5 array (i.e., a code of length 20): the array in the left 
depicts a situation that can be handled by a (1;2) PMDS but not by a (1;2) SD code; the 
second and the fourth rows have two erasures (denoted by E) each and there is no column 
containing two of these erasures. The array in the middle illustrates a situation in which 
the second and fourth rows have two erasures each, but the second column contains two of 
those erasures, which correspond to a total failure of the second device. Individual erasures 
in a row can always be handled by single parity (like in the first and the third rows). This 
situation can be handled by both (1;2) PMDS and SD codes. Finally, the array in the right 
shows the situation of three erasures in a row, and at most one in the remaining ones. This 
situation can also be handled by both (1;2) PMDS and SD codes (but not by RAID 6). 

In the next section we give the construction of both (1;2) PMDS and SD codes. From now 
on, when we say PMDS or SD codes, we refer to (1;2) PMDS or SD codes. 

2 Code Construction 

Consider the field GF(2 b ) and let a be an element in GF(2 b ). The (multiplicative) order of 
a, denoted O(a), is the minimum £, < £, such that or — 1. If a is a primitive element [1], 
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then O(a) = 2 b -l. To each element a 6 GF(2 b ), there is an associated (irreducible) minimal 
polynomial [I] that we denote fa( x )- 
Let a G GF(2 b ) and mn < 0(a). Consider the (m + 2) x mn parity-check matrix 

f Co £l • • • £n-l | Cn+1 ■ ■ ■ Q.2n-1 | • ■ ■ | C(m— l)n C(m— l)ri+l ■ ■ ■ Qmn-1 ) (1) 

where Cj denotes a column of length m+2, and, if e, denotes anmxl vector whose coordinates 
are zero except for coordinate i, which is 1, then, for < i < m — 1, 



Qin: Qin+li • • ■ ) £(i+l)n— 1 



a 



a 



in+l 



a 



in+j 



a 



(i+l)n-l 



\ a 2in a 2in ~ l 



a 2in ~ j Q;(2i-l)n+l 



(2) 



We denote as C^°\m, n; fa( x )) the [mn, m{n — 1) 
matrix is given by (CQ) and ([2]). 



2] code over GF(q) whose parity-check 



Example 2.1 Consider the finite field GF (16) and let a be a primitive element, i.e., O(a) 
Then, the parity-check matrix of C^(3, 5; fa( x )) is given by 
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:ly, the parity-check matrix of 


(5,3; 
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Let us point out that the construction of this type of codes is valid also over the ring 
of polynomials modulo M p (x) — 1 + x + ■ ■ ■ + x p_1 , p a prime number, as done with the 
Blaum-Roth (BR) codes [2]. In that case, O(a) =p, where a p ~ l = 1 + a + ■ • • + a p ~ 2 . The 
construction proceeds similarly, and we denote it C^ -* (m, n; M p (x)). Utilizing the ring modulo 
M p (x) allows for XOR operations at the encoding and the decoding without look-up tables 
in a finite field, which is advantageous in erasure decoding [2]. It is well known that M p (x) 
is irreducible if and only if 2 is primitive in GF(p) [1]. 
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Example 2.2 Consider the ring of polynomials modulo Mn(x) and let a be an element in 
the ring such that a 16 = 1 + a + • • • + a 15 , thus, 0(a) = 17 (notice, M 17 (x) is reducible). 
Then, the parity-check matrix of C^(4, 4; Mn(x)) is given by 
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We have the following theorem: 

Theorem 2.1 Codes (m,n; f a (x)) and C^°\m, n; M p (x)) are SD codes. 

Proof: According to Definition 11.11 we have to prove first that 3 erasures in the same row 
will be corrected. Based on the parity-check matrix of the code, this will happen if and only 
if, for any < i < m — 1 and < jo < ji < J2 < n — 1 , 



det 



/ 1 1 1 

a in+j a in+n a in+j 2 ^ q 

\ a 2in ~ jo a 2in ~ jl a 2in ~ j2 



But the determinant of this 3x3 matrix can be easily transformed into a Vandermonde 
determinant on ot?°, aP 1 and a j2 times a power of a, so it is invertible in a field and also in 
the ring of polynomials modulo M p (x) [2]. 

Next we have to prove that if we have two erasures in locations i and j of row £, say, 
0<z <j <77. — 1, and two erasures in locations i' and j' of row £', < i' < f < n — 1, 
<£<£'< m — 1, such that, either i = i', i =j', j' = i or j = j', then 
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After some row manipulation, the inequality above holds if and only if 
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1 © a J * is invertible in GF(q) since 1 < j—i < 0(a), but the same is true in the polynomials 
modulo M p (x) [2], thus, the inequality above is satisfied if and only if 

det ( „-i a A)n-f ) ^ 0- 

Assume that this determinant is 0. Redefining £^£' — £, then 1 < £ < m — 1 and we have 

a = a J . 

We will show that this is not possible. Assume that % — %' . Then, 

Assume that f > j. Then, £n = j' — j, a contradiction since j' — j < n — 1 and n < in < 
mn < 0(a). 

So, assume f < j. Then, £n = 0(a) + j' — j. But this also gives a contradiction, since 
In < mn — n < O(a) — n, and 0(a) + f — j > O(a) — n + 1. 
The cases i = j', j' = i and j = j' are handled similarly □ 



Next we show how to construct PMDS codes. 

Let a G GF(2 b ) and 2mn < 0(a). Consider the (m + 2) x mn parity-check matrix given 
by (0Q) and, for < i < m — 1, 
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(3) 



We denote the [mn, min — 1) — 2] code over GF(q) whose parity-check matrix is given 
by ([I]) and ([3]) as C^^m, n; fa{x)). The same can be done with the ring of polynomials 
modulo M p (x), in which case we denote the code C^\m,n; M p (x)). 



Example 2.3 As in Example 12.21 consider the ring of polynomials modulo Mu(x) and let 
a be an element in the ring such that 0(a) = 17 and a 16 = 1 + a + ■ • • + a 15 . Then, the 
parity-check matrix of 0^(2, 4; Mn(x)) is given by 
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Theorem 2.2 Codes C {1) (m, n; a; q) and C (1) (m, n; M p (x)) are PMDS codes. 



Proof: As in Theorem 12.11 we have to prove first that three erasures in the same row will 
always be corrected. 

Based on the parity-check matrix of the code, this will happen if and only if, for any 
< i < m — 1 and < jo < j\ < < n — 1, 
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Again, the determinant of this 3x3 matrix can be transformed into a Vandermonde 
determinant on a- 70 , a- 71 and a- 72 times a power of a, so it is invertible in a field and also in 
the ring of polynomials modulo M p (x). 

Next we have to prove that if we have two erasures in locations i and j of row £, say, 
0<i<i<w — 1, and two erasures in locations i' and j' of row < %' < f < n — 1, 
0<£<£'<m-l, then 
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After some row manipulation, the inequality above holds if and only if 
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Again, 1 ©a- 7 * is invertible in GF(q) and in the ring of polynomials modulo M p (x), thus, 
the inequality above is satisfied if and only if 
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Assume that this determinant is 0. Redefining I ■h- £' — £, then 1 < £ < m — 1 and we have 
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But this is not possible. In effect, assume first that i' + f > i + j. Then, since 2n < 2£n < 



0(a), we would have 2£n — + j' 



j, a contradiction since i' + f 



j < 2(n — 1). 



So, assume i'+f < i + j- Then, 2£n = O{o) + + j' — i — j. This also gives a contradiction, 



since 2£n < 2mn - 2n < 0(a) - 2n, and 0(a) + %' + j 1 



J > 0{a) -2n + 2. 



□ 
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3 Conclusions 



We have presented constructions of PMDS and SD codes extending RAID 5 with two ex- 
tra parities, solving an open problem since previous constructions were based on computer 
search. It is an open problem to extend the results to more parities. 
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